Observations on Nonlinear Risk-Sensitive Control

نویسندگان

  • Stefano P. Coraluppi
  • Steven I. Marcus
چکیده

This paper clariies the relationship between risk-sensitive and robust control. This topic has received attention in the recent literature. We show that the formulations are related through an appropriate limit that is more natural than the small noise limit considered previously. The relationship we discuss is more closely related to that which has been shown in the linear systems setting. 1 System Description and Problem Formulation We consider a discrete-time stochastic system de-ned on a nite time horizon, k 2 f0 where at time k, x k 2 X is the state of the system, u k 2 U is the control, y k 2 Y is the observed output, z k 2 < is the controlled output, and where w N?1 =1) are sequences of random variables which represent disturbances to the system. We assume the initial state of the system x 0 has a known probability distribution 0 on the set of states X. The total cost Z incurred in the system's evolution is given by the sum of the controlled outputs at each time, that is: Z = N?1 X k=0 z k (4) The total cost will have a probability distribution induced by the policy , and can be parametrized by the initial distribution 0 on x 0. Based on this distribution , we can consider a number of criteria for the optimal control of the system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Information-State Approach to Risk-Sensitive Tracking Problems

In this paper we use the information-state approach to obtain solutions to risk-sensitive quadratic control problems. Speci cally we consider the case of tracking a desired trajectory. Results are presented for linear discrete-time models with Gaussian noise, and also for nite-discrete state, discrete-time hidden Markov models with continuous-range observations. These results give insight to mo...

متن کامل

Risk Sensitive, Nonlinear Optimal Control: Iterative Linear Exponential-Quadratic Optimal Control with Gaussian Noise

In this contribution, we derive ILEG, an iterative algorithm to find risk sensitive solutions to nonlinear, stochastic optimal control problems. The algorithm is based on a linear quadratic approximation of an exponential risk sensitive nonlinear control problem. ILEG allows to find risk sensitive policies and thus generalizes previous algorithms to solve nonlinear optimal control based on iter...

متن کامل

Nonlinear Discrete-time Risk-sensitive Optimal Control

This paper is devoted to the study of the connections among risk-sensitive stochastic optimal control, dynamic game optimal control, risk-neutral stochastic optimal control and deterministic optimal control in a nonlinear, discrete-time context with complete state information. The analysis worked out sheds light on the profound links among these control strategies, which remain hidden in the li...

متن کامل

Risk - Sensitive Dual Control

In this paper, we develop new results concerning the risk-sensitive dual control problem for output feedback nonlinear systems, with unknown time-varying parameters. These results are not merely immediate specializations of known risk-sensitive control theory for nonlinear systems, but rather, are new formulations which are of interest in their own right. A dynamic programming equation solution...

متن کامل

Risk-Sensitive Control and Dynamic Games for Partially Observed Discrete-Time Nonlinear Systems

In this paper we solve a finite-horizon partially observed risk-sensitive stochastic optimal control problem for discrete-time nonlinear systems and obtain small noise and small risk limits. The small noise limit is interpreted as a deterministic partially observed dynamic game, and new insights into the optimal solution of such game problems are obtained. Both the risk-sensitive stochastic con...

متن کامل

Finite- dimensional optimal controllers for nonlinear plants

Optimal risk sensitive feedback controllers are now available for very general stochastic nonlinear plants and performance indices. They consist of nonlinear static feedback of so called information states from an information state filter. In general, these filters are linear, but infinite dimensional, and the information state feedback gains are derived from (doubly ) infinite dimensional dyna...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007